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ABSTRACT 

We derive probability density functions for the projected axial ratios of the real and mock 2PIGG 
galaxy groups, and use this data to investigate the intrinsic three dimensional shape of the dark matter 
ellipsoids that they trace. As well as analysing the raw data for groups of varying multiplicities, a 
convolution corrected form of the data is also considered which weights the probability density function 
according to the results of multiple Monte-Carlo realizations of discrete samples from the input spatial 
distributions. The important effect observed is that the best fit distribution for all the raw data is 
a prolate ellipsoid with a Gaussian distribution of axial ratios with (3 = 0.36 and a = 0.14, whilst 
for the convolved data the best fit solution is that of an oblate ellipsoid j3 — 0.22 and a = 0.1. 
Previously only prolate distributions were thought compatible with the data, this being interprated 
as evidence of filamentary collapse at nodes. We also find that even after allowing for the sampling 
effects, the corrected data is better fit using separate multiplicity bins, which display a trend towards 
more spherical halos in higher multiplicity groups. Finally, we find that all results in the real data are 
in good agreement with the mock data from ACDM simulations, KS tests showing that all comparative 
data have been drawn from the same distributions within the la confidence limits. 
Subject headings: galaxies: clusters: general; galaxies: halos 



1. INTRODUCTION 

The spatial distribution of galaxies traces the shape of 
the dark matter potential in which they are embedded. 
Simulations show that dark matter halos are not spheri- 
cal, as one would naively expect because of dark matter's 
non-dissipational natur e, but are strongly f lattened tri- 
axial eUipsoids (Dubinski fc Carlberg II1991I ). Although 
prolate and oblate shapes are equally likely in the simu- 
lations, dissipative infall of baryonic gas eventually forces 
the halo shapes tow ard pronounced oblateness with ax- 
ial ratios b/a > 0.7 ()Katz fc Gunnlll991l: [Dubinskilll994l : 
ICombes 20021. 

Because dark matter structures evolve self-similarly 
it is possible to derive the general shape of dark ha- 
los from analysis of single test objects. The first 
such attempts were carried out on rich clusters, whose 
shapes were found to be triax ial and strongly prolate 
(HI ionis. Barrow fc Frenk|[l99l[ ): however the non-linear 
evolution of these objects may play a role in sha ping their 
density distributions (e.g iBinnev fc Sil5ll979l ). Indeed 
there is evidence that the more high-density systems are 
more spherical th an low-density object s (random gaus- 
sian field work of iBardeen et al.l Il986l ) . The observed 
trends are the opposite of what is observed iii simu la- 
tions fe.g. lKasun fc Evrardl[2005HAllgood et al.ll2006f) . 

The distribution of Milky Way satellites shows 
a strongly oblate and flattened distribution 
(|Ruzicka. Palous fc TheisI l2007f) . which is also con- 
sistent with analy sis of the orbital planes of the 
Sagittarius dwarf (jJohnston. Law fc Maiewski I l2005fl 
and the Monoceros stream (jPenarrubia et al T 120051) . 
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However, star counts ()Lemon et al.ll200^ an d analysis of 
the s tellar stream of the Sagittarius dwarf (jlbata et al.l 
l2001h support a spherical halo. It is also uncertain 
whether the dwarf satellites can be used as test par- 
ticles, as they may not originate from c osmological 
sub-structures (iKroupa. Theis fc Boilvl[2005[ ). 

Groups of galaxies are likely to be the best testbeds 
to study the shapes of dark matter halos. Large 
group catalogs are now available from redshift sur- 
veys (e.g., th e 2dF Per c olatio n Inferred Galaxy Groups 
[2 PIGG1 of lE ke et al.' l2004l a nd the group catalog 
of IMerchan fc Zandivare j l2005l from Data Release 3 
of the Sloan Digital Sky Survey) and these allow 
a statistical approach to the study of the shapes 
of groups and the shape dependence on richness, 
multiplicity and dynamical evoluti on. Early studies 
marginally favoured prolate shape s (jFasano et al.l 119931 : 
lOrlov. Petrova fc TarantaevI l200il ). but not at the ex- 
clusion of oblate solutions. The most recent study of 
2PIGG groups bv lPHonis. Basilakos fc Ragone-Figueroal 
()2006l ) favours prolate groups, but is obtained by means 
of a multiplicity cut and as such represents a different 
method to the one presented here. Prolate results indi- 
cate that the original (oblate ?) shapes may have been 
strongly modified during gravitational collapse, or that 
filamentary collapse is in fact being witnessed. 

In this paper we carry out a detailed analysis of the 
shape of 2PIGG groups, coupled with extensive Monte 
Carlo simulations and comparison with groups ex t racted 
from the 2dF mock catalogs used by lEke et all ()2004D 
to optimize group selection. In section 2 we present our 
method for analysing the shape of observed groups and 
in section 3 apply this to the real and mock 2PIGG cat- 
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alog ()Eke et al.ll2004f) . Comparisons are made between 
the raw data and data that is corrected for the finite 
(indeed, often sparse) sampling by factors which we de- 
termine from Monte-Carlo simulations of suitably popu- 
lated groups. Furthermore, the mock and real data are 
considered separately and KS tests are used to confirm 
their distributional similarities. 

2. MODELLING SHAPE DISTRIBUTIONS 

To extract useful information from our projected group 
shapes we first make some simpli fying assumptions . 
With some theoretical justification ping fc Sutol |2003) 
we assume that our groups have three orthogonal axis 
with two of equal length, but allow our groups to be 
either prolate or oblate. This greatly simplifies the mod- 
elling compared to the case of general triaxial shapes. 
The problem is generally one of inversion, for which there 
is no unique solution in case of triaxial shapes. However, 
it is still possible to recover whether the distribution is 
more prolate or oblate-like, as will be discussed later. 

For now, we shall also make the assumption that we 
are able to measure the apparent group axial ratios per- 
fectly. Obviously this is not the case in practice since 
the group projection is represented by discrete galaxies, 
but we consider the effect of this in detail later. Figure [1] 
describes the simplified problem we are considering. In 
the diagram shown the object is oblate and symmetric 
about the z axis. C is where the tangent to the observer 
(at the point xq, zq in the x,z plane) cuts the z-axis, A 
is the distance between the origin and the tangent to the 
observer along the line normal to the tangent, a is where 
the ellipsoid cuts the z-axis and 9 is the angle between 
the z-axis and the tangent to the observer. The axial 
ratio for this projected image is found by considering the 
distance between the tangent lines to the observer (2A) 
and the larger axis that is projected normal to this on the 
sky (twice the radius of the ellipsoid in the x,y plane). 
The ratio between these values will tell us the observed 
axial ratio for each given 9 and ellipsoid shape. 

Knowing this we can investigate what the probability 
density function (PDF) of axial ratios would be for any 
given ellipsoid evenly sampled in spherical coordinate pa- 
rameter space. For a prolate or oblate object we have the 
standard form 



{uxf + {uyf + = a^ 



(1) 



where u > 1 for a prolate ellipsoid, u < 1 for an oblate 
ellipsoid, and a would be the radius of the sphere for the 
case u = \. 

We can then calculate the projected shape of the el- 
lipsoid on the plane of the sky when viewed at an ar- 
bitrary angle 6 by generalizing the results on the pro- 
jected e l liptici ti es of disc or e lhptical galaxies from, e.g., 
Hubbld (Il92l. ISandage Freeman fc Stokes (197Q ) and 
Milhalas fc Binneyl ( 198ll ): see appendix for details. The 
projected axis length A (Fig. [1]) is given by 



(2) 



so the apparent axial ratio q (defined to be less than 1 
for both oblate and prolate shapes: i.e. 9 = ^ or ^ 
respectively) is given by 



V? SIT? 9 + cos^ 9^q^ {oblate) 
— -^(prolate) 

r 

If we have a distribution of intrinsic shapes given by 
n{u) and random orientations then we can show (again 
see appendix for details) that the distribution of apparent 
axial ratios will be 



fiq) = q 



n{u)du 



[(1-U2)(q2_u2)]0.5 
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since only values oi u < q can contribute. If we write 
the intrinsic axial ratio as /3 < 1 (so /3 = w for oblate 
and /3 = 1/u for prolate shapes), then for a given /3, the 
number of systems with an observed q between, say, qi 
and q2 for oblate and prolate ellipsoids respectively is 
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So for a distribution of true axial ratios n{P), the ap- 
parent axial ratios for oblate ellipsoids follow 



N{qi,q2) 
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and for prolate ellipsoids 
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The simplest distribution to try would obviously be 
a delta function n(/3) = S{Pi) for some Pi, however a 
more sensible assumption for the underlying axial ratio 
function (or at least the first that should be tried) would 
be a Gaussian distribution, where 



df3 



n(/3) 



;exp 



2a2 



(7) 



We will use this model in what follows. 



3. METHODOLOGY 



The largest publicly available group samples are the 
2dF Percolation-Infer red Galaxy Groups (2PIGG) cat- 
alog (|Eke et all [20041 ). selecte d from the 2dF G alaxy 
Redshift Survey (2dF-GRS - IColless et all [200ll) . the 
Yang 2dF group catalog (Yang ct al. 2005) , and the Yang 
SDSS group catalog ( Weinmann ct al. 2005.) . For the 
purposes of this paper only the 2PIGG catalog and its 
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Fig. 1. — Geometric description of ellipsoid viewing problem. 



associated mock catalog will be used; the other catalogs 
will be used in a more comprehensive paper (in prepa- 
ration) comparing the different grouping methods. As 
in our previous work (Robotham ct al. 2006) we begin 
by selecting all groups from this catalog with 0.05 < 
z < 0.10. The z — 0.05 lower limit is motivated by the 
small volume sampled by 2PIGG at low redshift (so that 
groups may not be representative), while at z = 0.10 the 
2dF GRS apparent magnitude limit begins to exclude 
even moderate luminosity galaxies {Mb < —18) from 
the sample (making the members less representative). 

It is not trivial to decide how many discrete objects you 
require in an elliptically distributed sample to accurately 
determine the underlying shape. To resolve this problem, 
Monte-Carlo tests were performed using a range of initial 
spatial distributions randomly populated by a varying 
number of discrete points . Using the moments method of 
ICarter fc Metcal^ (llQSOT ) to determine ellipticity (found 
to be pre ferential to the flattening technique o f (Roocl 
11979( 1 in ()Plionis. Basilakos fc Tovinassianll2004f )). these 
simulated discrete points were analysed and compared 
to the true elliptical distributions that produced them. 
This method of simulating and measuring was carried 
out 1,000 times for each multiplicity of group distributed 
with an ellipticity defined by e = 1 — /? (i.e. an ellipticity 
of would be a circle). The multiplicity was varied in 
steps of 5 up to 300, and e in steps of 0.05 from up 
to 0.95. With this extensive sample available, compar- 
isons of standard deviations in determined Eo for different 
simulated multiplicities were made versus the intrinsic Si 
used to generate the samples. It was discovered that 
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Fig. 2. — Relative error of measured axial ratio, overplot with 
the best fit function. 

the observed ellipticity correlates almost linearly with 
the true ellipticity, error distributions are very close to 
Gaussian, and the standard error for a measured ellip- 
ticity is directly proportional to the axial ratio (when 
defined to be < 1). As we find a near linear correlation 
between the observed and the true ellipticity, by fitting 
to the distribution of the fractional differences between 
the intrinsic and observed ellipticities as a function of 
the multiplicity (n) we can correct the ellipticity. The 
empirical correction function is 



' \^0.99294 - (<L224438) _ (CLngii) + 1.22467 x IQ-^N 

However, this systematic correction is not required 
usually since it is so small compared to the standard 
error of the distribution of Ei for a given Eq- Using the 
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same data, a fit has been made of the relative axial ratio 
error against multiplicity (see figure [2|) . The standard 
error is given by 

/ /2.0119\ /8.3096\ , \ 

a^ = f3 (^0.0792 + [-^) + [-^) ~ ^'^^ x IQ-^iV j 

(9) 

This function is plotted against the data in figure O 
From this we see that independent of the underlying ej, 
20 objects are required to have a relative axial ratio error 
of 20%. If we have 20 objects in a group with measured 
elhpticity of 0.2 (/3 = 0.8), cr will be 0.16, whilst for a 
more elliptical group with eo = 0.6 {(3 — 0.4), the er- 
ror will be 0.08. There is obviously an issue with the 
observed distribution of ellipticities as we move towards 
0; a simple but accurate model is that the distribution 
remains Gaussian into negative values for the elhpticity, 
but these probabilities are mirrored back across the ori- 
gin. Accordingly, the distribution about a true ellipticity 
of will appear to be a Gaussian cut in half, and the dis- 
tribution about a true ellipticity of 0.4 is almost a pure 
Gaussian with a as stated. 

So far we have considered the correction required for 
each individual observation, i.e. the most likely intrin- 
sic ellipticity for a particular observed ellipticity for a 
certain multiplicity. In practice the rather complex dis- 
tortions for axial ratios near unity are best accounted 
for by considering the amount of distortion between the 
input and measured axial ratio distributions. Using this 
Monte-Carlo data it is trivial to find the correction fac- 
tor between a uniform PDF of input axial ratios and 
the measured non uniform distribution for each multi- 
plicity (see Fig. [3|). This technique has the advantage of 
correcting for the lack of axial ratios ~ 1 in low multi- 
plicity groups, and we don't lose any data, it is merely 
weighted depending on the significance of the measure- 
ment. This makes intuitive sense because if we had 10 
low multiplicity groups, 5 of which had axial ratios near 1 
and 5 have small values, overall the correct interpretation 
would be that the groups have close to circular true axial 
ratio since these are the axial ratios that are suppressed 
(low values of the PDF), and thus carry more significance 
when actually found. Figure [4] demonstrates the effect of 
this correction on a generated distribution of points for 
both an oblate and prolate ellipsoid with differing mul- 
tiplicities. For the prolate group of 20 objects with an 
axial ratio of 0.5 the best fit was with the corrected data, 
returning a prolate fit with mean axial ratio of 0.50 (re- 
duced of 6.25 compared to 27.5 for the uncorrected 
data). For the oblate group of 20 objects with an ax- 
ial ratio of 0.5 the best fit was with the corrected data, 
returning an oblate fit with mean axial ratio of 0.48 (re- 
duced of 2.04 compared to 63.37 for the uncorrected 
data). In the later case the result was particularly sig- 
nificant because the oblate fit was strongly rejected by 
the raw data alone, despite us knowing that this should 
be the returned best fit distribution. These results are 
encouraging since the convolution correction is best ap- 
plied to a large amount of input data, such as the catalog 
data that we have available. The caveat when using this 
technique is that extremely sharp features will always be 
smoothed out in low multiplicity data since the axial ra- 
tios themselves are not re-binned, only re-weighted; this 
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Fig. 3. — Observed PDF for different multiplicities plotted to- 
gether with the uniform input PDF used for the Monte-Carlo tests. 

effect being obvious in [H The type of distribution and 
the mean are reliably recovered however, and this is the 
the matter of most importance. 

These two different techniques, simply correcting the 
means of the distributions for sampling effects or using 
the whole PDF, can be used to produce two different dis- 
tributions of axial ratios. When used on test data they 
display good agreement for large multiplicity samples; di- 
vergence at large axial ratios is always present however, 
since for any multiplicity the Monte-Carlo tests strongly 
suggest axial ratios near 1 are observationally inhibited. 
This is significant since it is the axial ratios near 1 that 
help us to distinguish between prolate and oblate under- 
lying distributions: oblate distributions plateau at large 
axial ratios, whilst prolate distributions will drop to ex- 
tremely low values. 

Considering the first of these two methods for creating 
our PDFs, a cut-off was applied to the sample, requiring 
a minimum of 20 galaxies in order to achieve an accept- 
able level of confidence in the e values returned. In addi- 
tion, a lack of sufficient multiplicity will always have the 
effect of increasing the ellipticity measured; the obvious 
extreme example would be a multiplicity of 2, where the 
measured ellipticity will always be 1 , no matter the distri- 
bution that created them. Obviously, the ideal situation 
would be to increase the multiplicity limit further, and 
it is true that this would increase the confidence in our 
measured ellipticities, but two factors militate against 
this: first we are interested in a range of group multi- 
plicities, so would like to consider as broad a sample as 
possible, and second we need as many measured elliptici- 
ties as possible in order to produce a well defined binned 
PDF [N{q)). The remaining groups were analysed in the 
same fashion as the Monte Carlo models and sorted into 
axial ratio bins of size 0.05 from to 1, creating a binned 
PDF that can be compared to the distributions predicted 
for the models in Section 2. 

For the second method all group multiplicities are used, 
the only limitation being those of the original data set; 
i.e. multiplicities of 5 or more. A convolving routine 
was written in order to weight the significance of each 
result, which is a combination of the multiplicity and the 
axial ratio measured. These results were then placed in 
the same bins as used for the previous technique, allow- 
ing for direct comparisons. To aid analysis Monte-Carlo 
distributions were also created, these correspond to the 
PDF that would have been observed had the true PDF 
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Fig. 4. — Raw and corrected PDF plotted against the input ellipsoid data for a given group using the HEALPix method to sample the 
rotation parameter space evenly. On the left is a prolate group of 30 objects with an axial ratio of 0.5. On the right is an oblate group of 
10 objects with an axial ratio of 0.5. 



of axial ratios been uniform for each group multiplicity. 

As a point of comparison figure [5] shows how our Local 
Group appears projected on the sky at different mag- 
nitu de cuts (data taken from iPritchet fc van den Berghl 
Il999f). evenly sampled using the HEALPix isolatitude 
pixelization method to ensure a representative distribu- 
tion of convolution corrected and non corrected projec- 
tions of the Local Group in our rotation parameter space. 
The effect of the correction is most evident on the low 
magnitude cuts where the plateau at larger axial ratios 
is suppressed in the raw data, but after correction the 
plateau continues more evenly. This suggests the Local 
Group is oblate at low magnitude cuts. The extremely 
small axial ratios at high magnitudes is understandable 
in the context of the Local Group having significant sub- 
structure, and at high magnitude cuts only the regions 
around the Milky Way and M31 contribute. This creates 
a dumbbell structure that is increasingly smoothed out 
as fainter members are added, hence the systematically 
larger axial ratios as the magnitude limit is increased. 

4. CONTAMINATION BY INTERLOPERS 

A problem for any attempt to collect galaxies into as- 
sociated groups and clusters is the unavoidable presence 
of interlopers, as such it seems prudent to quantitatively 
define what the effect is. The situation is improved by 
the use of spectroscopy since this requires interlopers to 
have chance projections and chance recessional velocities, 
however it is clear this still won't eliminate all interlop- 
ers, just reduce the percentage of the population that are 
false. The 2PIGG catalog is spectroscopically s elected, so 
it is as good as reasonably possible. Figure 4 of lEke et al.l 
(|2004f ) shows three plots to describe the typical interloper 
rates in groups of different mass, redshift and multiplic- 
ity. As might be expected, interloper rates get worse with 
the largest multiplicities (hence mass) and at the high- 
est red shifts, however the redshift cut we have applied 
at 0.1 indicates a very steady interloper rate of 20% to 
30%. Over our regime of group masses (~ lO^^M© to 
~ lO^-^Mo) and multiphcity (5 to 163) the 2PIGG cat- 
alog is also predicted to maintain a consistant interloper 
rate. 

Since interlopers are an inevitable feature of our cata- 
log it is of paramount importance to discern what effect 
they have. Assuming a given group has an intrinsic shape 
seen in projection, the consequence of interlopers will be 
to add a circularly distributed population on top of any 



projection of the same maximum allowed radius. This 
is obvious since any grouping algorithm merely considers 
the magnitude, radial separation and, possibly, redshift 
of any potential new member. Intrinsically no restriction 
is made as to how objects affect the shape of the currently 
grouped objects. With this in mind a Monte-Carlo sim- 
ulation was designed to find how mixing together inrinsi- 
cally shaped ellipsoids of oblate and prolate varieties with 
an interloper population can distort the observed results. 
Of particular interest is how the same rate might affect 
different multiplicities. 

For the sake of interest a large variety of interloper 
rates were investigated, from 2% to 50%, although our 
main interest will be the interloper rate ^ 20%. Mul- 
tiplicities of 5, 10, 20, 50 and 200 were chosen to allow 
for population comparisons, and axial ratios of 0.05 upto 
0.95 were generated. For each Monte-Carlo distribution 
(a given interloper rate, multiplicity, ellipsoid type and 
axial ratio) 20,000 virtual groups with interlopers and 
20,000 groups without interlopers were generated (the 
number of runs was deemed more than adequate in pre- 
tests). KS-tests were carried out between all populations 
of interloper and non-interloper groups and the best fit 
solution determined. The result of this is a direct mea- 
sure of how interlopers can distort what is observed, and 
can be used as a transformation guide to find the real 
shape of the underlying ellipsoid. 

The main finding is that for all axial ratios the inter- 
loper groups are consistantly observed to be more spher- 
ical than their underlying population. As might be ex- 
pected, the effect is weakest for the lowest interloper rates 
and vica- versa (see Figure [6|). Perhaps the most inter- 
esting aspect to note is that the lower the multiplicity 
the smaller the distortion to the interloper distribution 
compared to the normal one, whether oblate or prolate. 
Another interesting outcome of these simulations is we 
find a hard limit for the observable ellipsoid as a func- 
tion of multiplicity and interloper rate (see Figure [7] for 
the prolate limits). The observable limit is smallest for 
groups with the lowest interloper rates and multiplici- 
ties, and worst for those with the highest interloper rates 
and highest multiplicities. The dependence on interloper 
rates is how one might expect, but the relation to mul- 
tiplicity might not be so obvious. These trends are ob- 
served strongly for both prolate and oblate shapes. 

A significant issue is the confusion effect: does the 
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Fig. 5. — Local group projected axial ratios at different V magnitude cuts using the HEALPix method to sample the rotation parameter 
space evenly. On the left are the uncorrected distributions. On the right are the convolution corrected distributions. 




Fig. 6. — The effect a 20% interloper rate has on prolate (left) and oblate right) best fit distributions. 
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Fig. 7. — Absolute detectable limits for given multiplicities and 
interloper rates. 



presence of interlopers make a prolate distribution look 



oblate and vica-versa, and how does multiplicity and the 
interloper rate affect the possibility of confusion? When 
confusion occurs it is generally at the largest axial ra- 
tios, an understandable result since as prolate and oblate 
shapes move closer to spherical it requires a smaller 
amount of random distortion to transform one distribu- 
tion into the other, and hence it doesn't affect prolate or 
oblate groups worse. 

It becomes easier to distinguish populations when the 
axial ratios are lower 0.5), however this improvement 
is lost when axial ratios are very small (less than the 
effective observable limit) because the distributions be- 
come very hard to fit at all. The distortion is particularly 
evident for higher multiplicities, as can be seen in Figure 
[51 Here all the input axial ratios are 0.1 and the inter- 
loper rate is only 5%, and yet the amount of distortion 
becomes quite significant as a function of multiplicity. In 
contrast the quality of fitting for axial ratios of 0.7 is very 
good even when interloper rates are at 20%, as is evident 
from the QQ-plots in Figure [H 

The last finding of note is that interloper rates alone 
account for some degree of distribution broadening, on 
top of the aforementioned shift to more spherical pop- 
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Fig. 8. — Distortion casued by 5% interloper rate for prolate groups of multiplicity 5 (top-left), 20 (top-right), 50 (bottom-left) and 200 
(bottom-right) . 



ulations. This is understandable simply because we 
are modeling a binomial distribution (every Monte-Carlo 
galaxy has a chance of being an interloper as described 
by the interloper rate), and as such there will be an as- 
sociated spread in observed shapes since when there are 
many interlopers a given group will appear more circu- 
lar and when there are fewer this effect is reduced. In 
fact the number of interlopers will follow a near gaus- 
sian distribution when np~ 5, so groups with multiplic- 
ity 20 and interloper rates of 20% almost meet these con- 
ditions. Otherwise we'll still observe a spread, just of a 
more discrete variety. This effect is clearly seen in Figure 
[5] where only the multiplicity 200 population undergoes 
a smooth distortion; this is because np= 10 and meets 
the requirements to approximate a gaussian form for the 
number of interlopers. The multiplicity 50 population 
has np= 2.5, so whilst the distribution is broadened it is 
not as smooth. The multiplicity 20 population has a very 
strong feature from 0.2-0.5. This is because np= 1 and 
as such the event of no interlopers is quite likely (36%), 



in fact it is almost exactly the same chance as one in- 
terloper (38%). Thus the two main features-the peak 
between 0-0.2 and the plateau from 0.2-0.5- are caused 
by interloper numbers of and 1 respectively. The slight 
hints of further plateaus beyond 0.5 are caused by inter- 
loper number beyond 1, and these only occur ^ 26% of 
the time. 

These results demonstrate that any observed gaussian 
distribution of ellipsoid axial ratios are, in general, de- 
scribing the upper limit of the axial axial ratio and the 
upper limit for the standard deviation. As the axial ratio 
is beyond the observing limit it is possible to make an 
estimate of the underlying distribution, bearing in mind 
the interloper rate and multiplicity. 

5. RESULTS 

The main results can be found in tables [T] and [21 and 
figures [TOl [Til [El and [131 for the raw and the corrected 
data respectively. The tables present parameter values 
for the best fit prolate and oblate distributions ob- 
tained, whilst the figures display the raw and corrected 
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Fig. 9. — QQ-plots for best fit interloper groups where interloper rates= 20% and axial ratio= 0.7. 



TABLE 1 

Best fit par.ameters for raw group data 



Multiplicity 1 




(T 




Reduced 


All groups (true oblate) 1 


0.16 


0.06 


1719.41 


95.52 


All groups (true prolate) 


0.36 


0.14 


74.62 


4.15 


All groups (mock oblate) 


0.14 


0.08 


1685.50 


93.64 


All groups (mock prolate) 


0.36 


0.16 


75.96 


4.22 


5-9 (true oblate) 


0.12 


0.06 


1836.35 


102.02 


5-9 (true prolate) 


0.34 


0.14 


66.71 


3.71 


5-9 (mock oblate) 


0.1 


0.06 


3028.97 


168.28 


5-9 (mock prolate) 


0.3 


0.14 


79.57 


4.42 


10-19 (true oblate) 


0.24 


0.04 


276.33 


17.27 


10-19 (true prolate) 


0.42 


0.12 


33.99 


2.12 


10-19 (mock oblate) 


0.26 


0.1 


194.30 


12.14 


10-19 (mock prolate) 


0.42 


0.14 


23.60 


1.48 


20+ (true oblate) 


0.3 


0.08 


103.66 


6.91 


20+ (true prolate) 


0.46 


0.14 


15.97 


1.06 


20+ (mock oblate) 


0.3 


0.1 


221.27 


15.81 


20+ (mock prolate) 


0.46 


0.12 


23.32 


1.67 



TABLE 2 

Best fit parameters for convolution corrected group 
data 



Multiplicity 1 




cr 


Reduced 


All groups (true oblate) 1 


0.22 


0.1 


28.47 


1.58 


All groups (true prolate) 


0.44 


0.18 


110.27 


6.13 


All groups (mock oblate) 


0.22 


0.14 


50.71 


2.82 


All groups (mock prolate) 


0.42 


0.18 


60.49 


3.36 


5-9 (true oblate) 


0.2 


0.1 


14.47 


0.80 


5-9 (true prolate) 


0.42 


0.2 


110.07 


6.12 


5-9 (mock oblate) 


0.16 


0.12 


38.09 


2.12 


5-9 (mock prolate) 


0.4 


0.2 


65.54 


3.64 


10-19 (true oblate) 


0.3 


0.1 


45.57 


2.85 


10-19 (true prolate) 


0.44 


0.14 


42.79 


2.67 


10-19 (mock oblate) 


0.28 


0.1 


27.00 


1.68 


10-19 (mock prolate) 


0.44 


0.16 


33.96 


2.12 


20+ (true oblate) 


0.32 


0.1 


38.31 


2.55 


20+ (true prolate) 


0.46 


0.14 


12.74 


0.85 


20+ (mock oblate) 


0.36 


0.1 


19.03 


1.36 


20+ (mock prolate) 


0.48 


0.14 


9.91 


0.71 



data overplotted with these best fit distributions along 
with their associated error eUipses. 

Firstly, we will consider the raw data with a multi- 
plicity cut-off of 20 (see figure [T3|) , since for the multi- 
plicity range described it will be the most reliable raw 
result and immediately comparable to other research. 
Assuming Gaussian PDF for either oblate or prolate el- 
lipsoids, tests were undertaken using the numerical 
integration capacity of Maple to an accuracy of 0.02 
for both the mean (0.1 to 0.7) and the standard devi- 
ation (0.06 to 0.3). The minimum solution found the 



groups to be prolate to a high degree of confidence (the 
minimal prolate solution returned 15.97 compared to 
103.66 for oblate), with a Gaussian distribution of mean 
P = 0.46 and standard deviation ap = 0.14. These 
values agr ee extremely well with those recent l y pre - 
sent ed by Plionis, Basilakos fc Ragone-Figueroal (|2006[ ) 
and lPaz et al.. (20061 ). For this data cut we have 15 de- 
grees of freedom giving a reduced of 1.06, which is 
within 1(7 expectations. 

The parameter gradient for a prolate ellipsoid distri- 
bution is very steep for the raw data of groups with mul- 
tiplicity between 5 and 9. The results strongly imply 
the distribution being narrow (0.12 < ap < 0.15), and 
restricted between (3 values of 0.44 to 0.48. More signifi- 
cantly the distribution is extremely far removed from any 
possible oblate distribution. This is intuitively obvious 
from the lack a plateau at the near circular extreme of 
the binned axial ratio PDF This is an effect that will al- 
ways be present in oblate ellipsoids due to the observed 
axial ratio always having to be greater or equal to the 
intrinsic axial ratio (true also for prolate ellipsoids), and 
because an oblate ellipsoid will possess a greater observed 
axial ratio (g) when compared to a prolate ellipsoid shar- 
ing the same intrinsic axial ratio (/?) inclined from our 
linc-of-sight by the same degree. The fact the measured 
binned PDF falls to almost zero for circular value axial 
ratios strongly precludes an oblate contribution except at 
insignificant levels. It is due to this effect of the observed 
axial ratio, _at a minimum, being equal to the intrinsic 
value that /? will always be smaller than the peak in any 
observed PDF, hence in this case /? is smaller than the 
obvious peak in the binned PDF at 0.6. 

The other raw measurements for different multiplicity 
groups are presented in the corresponding graphs and ta- 
bles, however such is the inherent biasing in these lower 
multiplicity data that little can be gleaned from their 
analysis alone. Instead, our last use of the raw data is 
to determine the consistency between the real universe 
and the ACDM dependent mock universe. It is com- 
pletely justifiable to make a comparison between the raw 
data and their multiplicity associated mock data, since 
the shape biasing (being a sampling artifact) will be the 
same. By directly comparing the minimal x^ fits and 
considering the cross catalog KS tests we will have an 
insight into the reliability of both the grouping algorithm 
and the cosmological model used. When considering ta- 
ble [T] and table [3] we can see remarkabl y good agr eement 
between the real and mock data (c.f. iPaz et al] [20061 . 
Considering the fits, we find near perfect agreement for 
P between the real and mock data, the major disagree- 
ments being for multiplicities of 5-9 (for prolate distribu- 
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Fig. 10. — All group multiplicities. Top plot shows best fits for the real 2PIGG data on the left and the relevant parameter contour plots 
on the right, whilst the bottom plot is a comparison to the mock 2P1GG data. 



tions the real data has (3 = 0.34 compared io (3 = 0.30 
for the mock data) and for 10-19 (for oblate distribu- 
tions the real data has a ~ 0.04 compared to cr — 0.1 
for the mock data) . We find for the most part extremely 
similar between the two data sets, another indication 
that they are truly describing the same underlying dis- 
tributions. Furthermore, when looking at the KS test 
results we find that all direct comparisons can be drawn 
from the same underlying distribution to within Icr ex- 
pectations. The self similarities between the data for dif- 
ferent multiplicities are not quite as good (tables [4] and 
E]), however, the real and mock data both reject shared 
self similar distributions to a high degree of significance. 
As would be expected from the fitting results, the most 
similar distributions are those for multiplicities of 20+ 
and 10-20 for both the real and mock data (2.9% and 
0.14% respectively). It can be seen from the contour 
data that these multiplicities only agree upon parame- 
ters when considering the 2a error contours, which is 
consistent with this KS test statistic. 

We now consider the convolution corrected data with 
weighted histogram bins. The results for the real and 
mock forms of the 2PIGG catalog can be seen in figures 
[TUl [TTl \n\ and [T31 and in table [21 Significantly, we now 
find that the distributions are not necessarily prolate. 
The complete data and the multiplicity cut for 5-9 is 
better fit by an oblate distribution, the cut-off for 10-19 
is equally well fit by either, and only the 20-1- cut is bet- 
ter fit by a prolate distribution and not with a definite 
rejection of an oblate distribution. If oblate distribu- 
tions are accepted for the corrected data then we see a 
strong divide between the low multiplicity cut (5-9) and 
the other two larger multiplicity cuts. The former has a 



distribution mean j3 = 0.2 whilst the latter two have a 
mean (3 ~ 0.3: in all cases = 0.1. If we still assume 
prolate distributions, despite evidence of strong rejec- 
tion in our 5-9 multiplicity cut, tjien we still see a small 
variation in distribution mean: _(5 = 0.42 ± 0.01 for 5-9, 
13 = 0.44 ± 0.01 for 10-19 and (3 = 0.46 ± 0.02 for 2Q+. 
The latter two have = 0.14, whilst the 5-9 cut has 
ap — 0.2. Within the Icr error contours the two higher 
multiplicity cuts share the same distribution, (3 ~ 0.44 
and (jp = 0.14, whilst the 5-9 cut is strongly indicated 
to be a truly different distribution, lying outside the 3(t 
error contours. This finding is consistant with the previ- 
ous KS tests which showed the two high multiplicity cuts 
to be the most similar; once the distributions are cor- 
rected they become identical. Evidence of oblate shapes 
has been presented before in literature: iFasano et all 
(|1993l ) and lOrlov. Petrova fc TarantaevI ([20011) being no- 
table examples, though with a smaller sample size and 
different multiplicity limits respectively. These au- 
thors als o found a non-rejec table oblate distribution, 
although iFasano et al.l (Il993l) favo ured the prolate fit. 
lOrlov. Petrova fc TarantaevI (|2001 f) found an axial ratio 
mean of 3:1 for their sample of small galaxy groups (mul- 
tiplicities of 3-8) , a figure that can be considered consis- 
tent with our data {(3 — 0.2 for group multiplicities 5-9) 
given that the statistics are significantly poorer (2703 
galaxies in 485 groups). Prolate fits have been found to 
be the best fit for large cluster samples ([Cooravl |2000D 
which would be consistant with our 20-f multiplicity cut 
(these systems being the most comparable to cluster en- 
vironments). 

Regarding comparisons to the mock catalog we once 
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Fig. 11. — Group multiplicities of 5 to 9. Top plot shows best fits for the real 2PIGG data on the left and the relevant parameter contour 
plots on the right, whilst the bottom plot is a comparison to the mock 2PIGG data. 



TABLE 3 

KS TEST COMPARISONS BETWEEN TRUE AND MOCK DATA FOR DIFFERENT MULTIPLICITIES 



1 


Real raw All 


Real raw 5-9 


Real raw 10-19 


Real raw 20-1- 


Mock raw All 1 


68.63% 


1.16a;10-3% 


6.27 X 10-3% 


1.55 X 10-^% 


Mock raw 5-9 


4.59x10-4% 


27.11% 


5.72 X 10-"% 


5.04 X 10-"% 


Mock raw 10-19 


1.61 X 10-8% 


5.14x10-10% 


22.94% 


0.06% 


Mock raw 20-f 


3.43 X 10-13% 


4.02 X 10-21% 


0.07% 


67.49% 



TABLE 4 

KS TEST COMPARISONS BETWEEN SELF SIMILAR TRUE DATA FOR DIFFERENT 

MULTIPLICITIES 



1 


Real raw All 


Real raw 5-9 


Real raw 10-19 


Real raw 20-f 


Real raw All 1 


1 


N/A 


N/A 


N/A 


Real raw 5-9 


0.04% 


1 


N/A 


N/A 


Real raw 10-19 


1.11 X 10-3% 


5.35 X 10-1° 


1 


N/A 


Real raw 20-1- 


1.83 X 10-*^% 


8.27 X 10-12% 


2.90% 


1 



again see consistent fitting between the real and mock 
data, accounting for the corresponding error ellipses the 
fitting between the real and mock data is actually better 
than for the raw data. However, the mock catalogs do 
not find such similar distributions for the 10-19 and 20-1- 
cuts, either for oblate (/3 = 0.28, /? = 0.36 respectively) 
or prolate (/3 = 0.44, (3 = 0.48 respectively) solutions. It 
should be noted that a KS test cannot be done in this sit- 
uation since it is only applicable to data with unweighted 
histogram frequency points. 
Perhaps the most interesting aspect of this analysis is 



the tentative evidence it shows for generally more spheri- 
cal groups for larger multiplicities, not just as a numerical 
artifact. This would be consistant with large multiplic- 
ity groups being more virialised and dynamically more 
evolved, consistant with hierarchical formation of small 
groups collapsing along filaments and larger groups form- 
ing at nodes. The features that account for the high el- 
lipticity sub sample in low multiplicity groups are a low 
f3 for an oblate solution, and a large a for a prolate so- 
lution. Either way there does appear to be a low axial 
ratio population that must be accounted for, and that is 
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Fig. 12. — Group multiplicities of 10 to 19. Top plot shows best fits for the real 2PIGG data on the left and the relevant parameter 
contour plots on the right, whilst the bottom plot is a comparison to the mock 2PIGG data. 



TABLE 5 

KS TEST COMPARISONS BETWEEN SELF SIMILAR MOCK DATA FOR DIFFERENT 

MULTIPLICITIES 



I Mock raw All Mock raw 5-9 Mock raw 19-20 Mock raw 20-|- 

Mock raw All I 1 N/A N/A N/A 

Mock raw 5-9 2.45 X 10"^% 1 N/A N/A 

Mock raw 10-19 1.97 X 10-3% 1.06 X IQ-^^ l N/A 

Mock raw 20-1- 1.44x10""% 4.056 X 10-^^% 0.14% 1 



not present in the higher multiphcity data. It is worth 
remembering that at low magnitude cuts (corresponding 
to higher multiplicities) the Local Group demonstrates 
oblate characteristics, so this result is not necessarily 
suprising. This result is even more reliable occurring as 
it does in the region of the PDF that is least corrected 
in convolution. In fact in this region highly elliptical 
results for low multiplicity groups are reduced in weight- 
ing, so the fact the signal is still so strong indicates a 
real population difference. This trend would appear to 
be in the opposite sense to wh at is expected in simula- 
tions (see (lA llgood et al.|[200^ and references therein), 
but it should be noted that these results are much closer 
to agreement after correction. 

6. CONCLUSION 

We found good agreement between our results for raw 
data with a simple 20-1- multiplicity cut and findings by 
other authorSj indicating a strongly prolate distribution 
with a mean f3 — 0.46 and CT/j — 0.14. However, when fits 
were applied to convolution corrected distributions which 
allow for the error introduced by finite sampling we found 



that for the higher multiplicity cuts both oblate and pro- 
late distributions could produce reasonable fits. More 
significantly, evidence suggests that these large multiplic- 
ity populations share the same underlying distribution (if 
oblate: /5 ~ 0.3 = 0.1, if prolate /5 - 0.44 ct/j = 0.14). 
There is evidence in the data that low multiplicity groups 
have a highly elliptical sub population that is missing in 
other multiplicity cuts: for 5-9 multiplicity cuts oblate 
distributions produce a better quahty of fit (/? = 0.2, 
(T/3 — 0.1), this is despite the effect of the convolution 
correction being to reduce this signal. This is seen as ev- 
idence of high multiplicity groups being more spherical, 
and is consistant with them being located in nodes, or at 
least in a less filamentary structure than the extremely 
low multiplicity groups. Large multiplicity groups are 
better fit by a prolate distribution in the corrected data, 
but oblate fits are not as strongly rejected as in ear- 
lier work which did not account for the sampling bias. 
The effect of interlopers was extensively considered, and 
the results of this work imply all measurements obtained 
should be considered upper limits for both the under ly- 
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Best fit plots for raw and corrected data using oblate and prolate distributions 
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Fig. 13. — Group multiplicities of 20 upwards. Top plot shows best fits for the real 2PIGG data on the left and the relevant parameter 
contour plots on the right, whilst the bottom plot is a comparison to the mock 2PIGG data. 



ing axial ratios (i.e. the true non-interloper distribution 
will be more elliptical than that measured) and the stan- 
dard deviations of the populations. 

KS tests were utilised where appropriate and indicate 
that the real and mock catalogs are in very good agree- 
ment over all ranges. For all multiplicity comparisons the 
real and mock data share the same underlying distribu- 
tion to within la expectations, and the most self similar 
populations are for multiplicities 10-19 and 20-I-, which 
appear consistent with being part of the same overall 
population once the corrections have been applied. 

Further exploration of projected group shapes will be 



presented in a future paper which will consider the dif- 
ferences we find when colour cuts and different grouping 
algorithms are used. 
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APPENDIX 

For a prolate or oblate ellipsoid we know that 

[uxf + {uyf + ^a^ (10) 
where u > 1 for a prolate ellipsoid, and u < \ for an oblate ellipsoid. So in z-x plane 

{uxof + zl^a^ (11) 

zl = „ (ya,p)2 (12) 

zl2^-pL = -2^.^X0 (13) 

^From standard trigonometry 

tan (I - 6*) = cot(6') (14) 



so the gradient with respect to the observer is defined by 

5zq M^Xq 



and 



giving us 



= cot6' (15) 

OXq Zq 



(16) 







1 


- \l 




^0 


u^zl 


V? 




K4 



4^2 /„2 



/,From figure [T] we can see 



eot^^ = ^=.^^-lj (17) 
^-sin^ (18) 



C ^ = zn- XQ.m (19) 
smy 

where m is the gradient, and will necessarily be the opposite sign to xq. This gives the z value for the z-axis/tangent 
intercept. 

C = zo- xo = zo + !^ = — (20) 

zo zq zq 



Using p?]) we get 



^a^^)smH = a^^-^+sin'9] (21) 



2\ /„„o2 



COS^ 9 



(23) 



We define the apparent axial ratio (q) to be less than 1 for both prolate and oblate ellipsoids. Thus for oblate ellipsoids 
q — and for prolate ellipsoids 9 = So 

sin^ e + cos^ 9 = q^( oblate) = -^(prolate) (22) 

q^ 

To have g between q and g + (Jg, the symmetry axis has to lie at an angle between 9 and 9 + 69. 

\5q/S9\ 

Mathematically this means if q changes rapidly with respect to 9 then 69 is small. A simple example is to imagine 
a needle (an almost completely prolate object): when we view this end on and alter the angle, 6q/69 will be huge; 
our observed axial ratio q has changed from 1 (a perfect circle) to near (by this definition almost a line) over an 
infinitesimally small angle. So the range of angles over which we might say the axial ratio is similar (69) is very small. 
However, when we consider the same needle side on, and rotate as before, 6q/69 will be very small; q is changing very 
gradually. So for a prolate object viewed with the symmetry axis at an orthogonal angle, the range of angles possessing 
similar axial ratios is much larger than when viewed with the symmetry axis parallel to our line-of-sight. 

For a given u, and assuming a random orientation of symmetry axes for all the ellipsoids, the relative probability 
of observing the ellipsoid between two axial ratios (say qi and 52) is given by the relative magnitude of the cosine 
between angles 9i and 92 that correspond to these observed ellipsoids. This is exactly the same factor that has to 
be considered when trying to determine what fraction of the angular area of the celestial sphere is contained within 
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different declinations. As an example, if q varies the same amount between 0° and 10° as it does between 80° and 90°, 
the difference in the solid angle of sky subtended will be i^°os%o —11.43; thus it is 11.43 times more likely we will see 



the ellipsoid in the larger 9 range. Taking this to infinitesimal differences we have 

|cos((? + 60) — cosO\ = \cos 9 cos 69 — sm9sm66 — cos9\ = sint 



(24) 



So of the ellipsoids under consideration, a fraction fim956 will have their symmetry axes directed at an angle 9 to the 
line-of-sight. Now we can write 

— <-) 

Galaxies with u in the range {u,u + 5u) contribute to the observed f{q)6q galaxies with axial ratios in the range 
{q, q + 6q) accordingly 

n{u)6u sm96q 



^ \6qm 

So the probability density function (PDF) of observed axial ratios (q) is given by 



f{q) = / n{u) 



/ 8in9 \ 

\JsU6e\J 



du 



For an oblate ellipsoid (from Eq. [3]) 



u2(l-cos^ 9)+cos^9 = q^ 



? 9 

COS^ 9 = 



Similarly 



2 • 2 

u sm ( 



1-it^ 



1 — sin 



1 -U2 



It follows that 



6_q 

69 



,2\0.5 



(1 



(1-^2)0.5 (1_ ^2)0.5 



6q 



q 



2m0.5 



Using equations (P7)) . ([?T1) and we have 



fiq) = q 



n{u)du 



[(1-Ii2)(g2_^,2)]0.5 



(26) 

(27) 

(28) 

(29) 

(30) 
(31) 

(32) 
(33) 

(34) 



We now define the intrinsic axial ratio to be a value f3 always less than 1, for an oblate ellipsoid (3 = u. f{q) alone give 
us a PDF of the observed axial ratios, however our data will have to be binned for analysis. So for a given value of (3 
we integrate over a range of observed axial ratios (say qi to (72) to construct a number density function for observed 
axial ratios A^(/3, 52) where 



N[P,quq2) 



1 



12 



qdq 



Considering the integral alone we use the substitution q ~ f3 cos 9 (where 6q = —P sin 969), giving us 



92 



/32 cos 9 sin 9de 

q, ^/?2 C0S2 e - /32 



92 



91 



/32 COS 9 sin 9d9 
i(3 sin 9 



92 



iP cos 9d9 = [il3sm9 + C] 



92 



91 



Rearranging the original substitution and replacing sin 9 accordingly, the integral becomes 

92 



SO for an oblate ellipsoid with a given (3 

Ni/3,qi,q2) 



\/{q'2-P')-^/{q!-P') 



(35) 



(36) 



(37) 



(38) 
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For a prolate ellipsoid f3 — ^ so the integral in (P5|) is multiplied by a factor ^ , thus [3^1 becomes 



«2 



(3^ sin I 



91 



/32 C0S2 e ( COs2 6'-/32) 7qi i COS- 



. sm t/ ^ 
cosf 



1 «2 



«1 



so for a prolate ellipsoid with a given /3 



92 



91 



(39) 



(40) 



the only difference is a factor of the apparent axial ratio being considered, (this makes sense because you would expect 
a prolate object to have a larger number density of small observed axial ratios). Putting this all together, for a function 
n(/3) using oblate ellipsoids we find 



and for prolate ellipsoids 



A^(9i,92) 



^(91,92) 



n{f3) 



n{f3) 



92 



91 



dp 



dp 



(41) 



(42) 



